Efficient Language Model Construction for Spoken Dialog Systems by Inducting Language Resources of Different Languages

نویسندگان

  • Teruhisa Misu
  • Shigeki Matsuda
  • Etsuo Mizukami
  • Hideki Kashioka
  • Haizhou Li
چکیده

Since the quality of the language model directly affects the performance of the spoken dialog system (SDS), we should use a statistical language model (LM) trained with a large amount of data that is matched to the task domain. When porting an SDS to another language, however, it is costly to re-collect a large amount of user utterances in the target language. We thus use the language resources in a source language by utilizing statistical machine translation. The main challenge in this work is to induct automatic speech recognition results collected using a speechinput system that differs from the target SDS both in the task and the target language. To select appropriate sentences to be included in the training data for the LM, we induct a spoken language understanding module of the dialog system in the source language. Experimental construction using over three million user utterances showed that it is vital for selecting from the translation results.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

مقایسه روش های طیفی برای شناسایی زبان گفتاری

Identifying spoken language automatically is to identify a language from the speech signal. Language identification systems can be divided into two categories, spectral-based methods and phonetic-based methods. In the former, short-time characteristics of speech spectrum are extracted as a multi-dimensional vector. The statistical model of these features is then obtained for each language. The ...

متن کامل

A flexible and integrated interface between speech recognition, speech interpretation and dialog management

This paper presents an integrated interface between speech recognition, speech interpretation and dialog control intended for spoken dialog systems coping with natural speech input. During the system design phase the interface co-ordinates corpus acquisition and annotation, grammar development and the construction of stochastic hierarchical language models. During system runtime, it links toget...

متن کامل

A Flexible and Integrated Interface be Speech Interpretation and Di

This paper presents an integrated interface between speech recognition, speech interpretation and dialog control intended for spoken dialog systems coping with natural speech input. During the system design phase the interface co-ordinates corpus acquisition and annotation, grammar development and the construction of stochastic hierarchical language models. During system runtime, it links toget...

متن کامل

Stochastic Bi-Languages to model Dialogs

Partially observable Markov decision Processes provide an excellent statistical framework to deal with spoken dialog systems that admits global optimization and deal with uncertainty of user goals. However its put in practice entails intractable problems that need efficient and suboptimal approaches. Alternatively some pattern recognition techniques have also been proposed. In this framework th...

متن کامل

Design and Evaluation of Spoken Dialog Systems

Interactive spoken dialog systems extend the range of automated telecommunication services beyond simple limited-choice form-filling applications to goal-directed tasks covering richer, more complex domains. Creating effective and efficient dialog systems requires not only accurate ancl robust speech recognition and language modeling, but also iterative, principled design of the user interface ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014